Bernoulli 15(4), 2009, 1259-1286 
DOT: 10.3150/09-BEJ190 



Evaluation for moments of a ratio with 
application to regression estimation 

PAUL DOUKHANi and GABRIEL LANG^ 

^ UMR 8088 Analyse, Geometric et modelisation, Universite Cergy-Pontoise, 2, avenue Adolphe 
Chauvin, 95302 Cergy-Pontoise Cedex, France. 
E-mail: paul.doukhan@frce.fr 

^ UMR 518 Mathematique et Informatique appUquees, AgroParisTech ENGREF, 19 avenue du 
Maine, 75732 Paris Gedex 15, France. 
E-mail: gabricl. lang@cngref. agroparistech.fr 

Ratios of random variables often appear in probability and statistical applications. We aim to 
approximate the moments of such ratios under several dependence assumptions. Extending the 
ideas in CoUomb [C. R. Acad. Set. Paris 285 (1977) 289-292], we propose sharper bounds for the 
moments of randomly weighted sums and for the L^-deviations from the asymptotic normal law 
when the central limit theorem holds. We indicate suitable applications in finance and censored 
data analysis and focus on the applications in the field of functional estimation. 
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1. Introduction 

We consider statistics with the form of a ratio. This situation arises naturally in the 
following simple and generic example. Let (K:,W^i)i>o be a stationary sequence with 
values in a finite space V x W, with V C M. A conditional expectation can be expressed 
as 

Two examples of statistical ratios can be derived from this expression: 

• For discrete random variables, it is empirically estimated from a sample (Vi, Wi)i<i<„ 
by the random quantity 

WfU lU/ ^ l/nJ:tly^■HW^^w} 
E{Vo\Wo=w) = 1/ ■.J... ^• 

^/n}2i=l^W^=w} 
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• The case of real-valued data is more involved as T{{v,w}) has no rigorous mean- 
ing, but standard smoothing techniques allow us to consider extensions. Replacing 

= w} by an approximate Dirac measure 5n(w,Wi), the estimate corresponds 
to the Nadaraya- Watson kernel estimator that will be studied in detail in the sequel. 

In this paper, we consider a ratio of two empirical quantities, namely 
- TV - 1 " - 1 " 

Rn^^—, Dn = — y Uin,Nn= — y UinVin, (1) 

-^n 1=1 i=l 

Ui_n and Vi^n being two arrays of random variables. Examples of this are: 

• Functional estimation of a conditional expectation: let {Xi,Yi) e R'' x R be a sta- 
tionary process and K a kernel function. If we define 

U,,n = K{{X, - x)/hn)/hi, V,,n = Y,, 

then Rn = r{x) is an estimator for r{x) ~ E{Yi\Xi ~ x) and hn — > 0, nhf^ — > oo as 
n—i'oo; see Tsybakov [28] for a general setting and Ango Nze and Doukhan [3] for 
dependent data cases. 

• Computation of empirical means for censored data: let the censoring Ui ~ Ci G {0,1} 
be independent of a process {Vi) and assume that Vi is observed if and only if Ci = 1. 

^" " #{*e{i,7i}|C, = i} ^ 

l ' J I » J i<.(<„^Ci = l 

A example of this situation is the estimation of covariances of a process X under 
censoring where Vi = XiXi+i. Under stationarity, the covariance function is 7x(^) = 
1y[(-)/ {lc{^) +ECg), where Yi = CiXi is observed. Furthermore, moments of the 
empirical covariances are used to build the periodogram from the censored data. 

• General weighted sums 



may be used to model various quantities like prices, with prices per unit Vi and 
volumes Ui, as in [20]. 

Various alternative questions also involve a division: 

• Functional estimation of point processes. A compound Poisson processes (CPP) can 
be expressed as ^ = X]j=i '^j^Xj for some Poisson variable N and some random pro- 
cess {aj,Xj)j>i, a.j > 0, Xj G R'^. For a sequence of mixing couples of CPP {$i,r]i)i>i 
with /i = E?/i = E^i , Bensaid and Fabrc [5] estimate the Radon-Nikodym den- 
sity If = d-ii/dv with kernel estimates ipn = Qnl jm with /„(a;) = ^^"=1 * Kn{x), 
9n{x) =YJi=i^i^ Kn{x), whcrc, for example, £,i^Kn{x) =J2f=i^jKn{Xj - x) with 
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Kn{x) = K{x/h)/h'^. The procedure is thus analogous to the Nadaraya- Watson esti- 
mator. Quadratic errors of this ratio are bounded under the assumption |<p„(x)| < C 
that can be easily relaxed using our result. 

• Self- normalized sums, for example, in [11]. 

• Simulation of Markov chains and the Monte Carlo MC technique widely developed 
in the monograph of Robert and Casella [23]; a more precise reference is Li and 
Rabitz [21]; see relations (45)-(47), which explicitly involve ratios for reducing the 
dimensionality in a nonparametric problem. 

• Particle filtering, considered from the theoretical viewpoint in [12], and for applica- 
tions to change point problems in [18]. 

Deducing the convergence in probability of the ratio from the convergence of the denomi- 
nator and numerator is straightforward, but in some statistical problems, L^'-convergence 
has to be checked. Evaluating the moments of i?„ is much more difficult, even if one knows 
sharp bounds of moments for both the numerator and the denominator. Curiously, we did 
not find many references on this subject. One method is to compute the exact distribution 
of the ratio, as Spiegelmann and Sachs [25] did for the moments of a Nadaraya-Watson 
regression estimator with {0, l}-valued kernels. In this case, independence allows the use 
of binomial-based distributions. However, such computations are generally difficult to 
handle. An alternative is the expansion in [6]. We addressed this problem for a depen- 
dent data frame in the paper [7] , published after Gerard Collomb's death. In [6] and [7] , 
CoUomb assumed that convergence rates in for q> 2p are known for the denominator. 

This limitation is avoided here by using an interpolation technique and we shall only 
assume such rates for some q> p. With the notation (1), we set 

Nn=EN,„ Dn=ED„ and Rn = ^. (2) 

We aim to provide L^-rates of convergence to of the expression 

A„ = Rn — Rn ■ (3) 

In some of our applications, the expectations Nn and Dn are constant. In other cases, 
they converge to some constants N and D as n oo, and the moments of the ratio may 
be proven to converge with the bound 



Rn ^ 

D 



<l|A„||p- 

P 



Rri 

D 



Convergence in probability or a.s. is immediate, but to obtain moment bounds, one has 
to divide by a non-zero expression; for simplicity, from now on, we will assume that 
Ui^n > 0. The previous expression is then also a weighted sum 



1=1 ^-^J — ^ z— 1 
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so that Rn belongs to the convex hull of (l^,n)i<i<Ti- 

The paper is organized as follows. Section 2 is devoted to the main lemma and com- 
ments. The two following sections are dedicated to its applications: to simple weighted 
sums in Section 3 and to the Nadaraya- Watson kernel regression estimation rn{x) of 
a regression function r{x) = ¥.{Y\X = x) in Section 4. The latter is divided into two 
subsections: the first subsection directly applies the lemma to provide the minimax 
bound ||rvi(a;) — r(x)||p = 0{n~P^^^'''^'^^) in an estimation problem with dimension d 
and regularity p less than 3; the second subsection makes use of the same ideas with 
a slight modification to derive the (also minimax) bound || sup^.^^ |f„(a;) — ?'(a:)|||p = 
©((n/lognj-P/^^p+d)^ Q^pj. ^ suitable compact subset B cR'' (see Stone [26] and [27]). 
We prove our result under various dependence assumptions: independent, strongly mix- 
ing, absolutely regular or weakly dependent (cither causal or non-causal) sequences. 
The last section includes the proofs; it consists of four subsections, devoted to the main 
lemma, weighted sums, moments of Nadaraya- Watson estimation and sup bounds of this 
estimator, respectively. 



2. Main lemmas 

Lemma 1 means that for q slightly larger than p, the rate of the qth order moment of the 
denominator and of the pth order moment of the numerator allow us to derive a bound 
of the rate for the pth order moment of the ratio. 

Lemma 1. Assume that \\Nn — Nn\\p < fn and \\Dn — Dn\\q < I'n for some q > p. If, 
moreover, ||C/i,n^i,n||r < C*!! and \\Vi,n\\s ^ Cn, where q/p — q/r>l, l/p>l/q + l/s, then 

where a, j5 are chosen from the parameters p, q, r, s by setting 

\P s qj \p r 

Remarks. 

• In all of our examples, c„ = c will be a constant. 

• If C„ = C is also a constant, then we assume that r ~ so that /? = 1. In this 
case, large values of q give r close to (and larger than) p; if now q> p is very close 
to p, then r needs to be very large and s even larger. 

This is the situation for weighted sums or censored data questions. 

Here, Vi,n — Vi and < Ui_n = Ui, and the sequence {Ui,Vi) is stationary. More- 
over, Vn = cj ^Jn and thus A„ = Oirr^l"^) if a = 2/s. This condition can be expressed 
as s = p{q + 2)/{q — p), as proved in the forthcoming Theorem 1. 
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• When the sequence C„ is not bounded, in order to control the corresponding term, 
we shah use an exponent /3 < 1 and assume that l/p > l/q + 1/r. The order q 
of the moment of the denominator should be larger, as well as the order of the 
moments of the variables Vi^n (in the case of functional estimation, for example). 
Here, f„ = c/ \/nh^ ^ Vv^ a-s h„ — >-„_).oo and n/ij^ — )-„_).oo oo- 

• Orlicz spaces. Instead of L'^-norms, we may consider Orlicz norms and ask only for 
xP log"^ x-ordcr moments of the denominator. Exponential moments of the variables 
Vi n would be used because of the relations (4), (14) and of the Pisier inequalities 
(15). 

• Suprema. The same equations (4), (14) and (15) arc adapted to derive bounds of 
suprema for moments for expressions involving an additional parameter; an emblem- 
atic example of this situation is the regression estimation given in Section 4.2. 

We consider two distinct classes of applications in Sections 3 and 4, devoted respectively 
to weighted sums and nonparametric regression. The following inequality is essential to 
bound the uniform rates of convergence of a Nadaraya- Watson regression estimator and 
it is thus presented as a specific lemma. 

Lemma 2. Letting < a < 1, we have 

Dn\An\<\Nn-N,,\+^-^\D,,-D,,\+ max ' ,"' . (4) 

Inequality (4) also implies tail bounds for A„'s distribution. 

-convergence may also be addressed as follows, as suggested by an anonymous 
referee. 

Corollary 1. Assume that u~^A„ -T-„-i.oo Z converges in distribution to some Z such 
that \\Z\\p> < oo for some p' <p. If the conditions in Lemma 1 hold and the hound is such 
that II A„||p < cv„ for some c > 0, then 

v-'\\A,,\\,, \\Z\\p,. 

n— >oo 

Remarks. 

• For weighted sums, a central limit theorem is obtained in all the cases considered 
below so that the result applies. 

• For the Nadaraya-Watson estimator, Ango Nze and Doukhan [1] prove that 
^Jnh'^{r{x) — r{x)) — >■ N{a{x), b{x)) for functions a{x), b{x). For the case of bounded 
regressors, the result also holds for weakly dependent cases (see Doukhan and 
Louhichi [14]). 
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3. Weighted sums 

We consider here the simplest apphcation of Lemma 1. Let {Ui,Vi)i^z be a stationary 
sequence and set D„ = J2^=i Ui/n, Nn = J27=i UiVi/n. Then Nn = N = ¥.UiVi, £>„ = 
D = EUi and Rn = iV„/f)„, Rn^R = N/D. 

Theorem 1. Let {Ui,Vi)i^z be a stationary sequence with Ui > (a.s.). Let <p<q 
and assume that \\UiVi\\r < c for r =pq/{q~ p) and ||l^|js < c for s = p{q + 2)/{q — p). If 
the dependence of the sequence {Ui,Vi)i(z^ is such that 



then \\Rn- R\\p = 0{n-'^^^). 

From now on, we assume that ||T^i|ls < c and ||C/iT^i||r < c, and prove that (5) holds. 

3.1. Independent case 

Assume that {Ui, Vi) is i.i.d. Assume that ||J7o|lg < c and ||C/o^ollp < c. From the Marcin- 
kiewikz-Zygmund inequality for independent variables, 



The Holder inequality implies that the assumptions hold if ||?7o||(}7 a-nd |1 Vb||gp/(g_p) are 
bounded. 

3.2. Strong mixing case 

Denote by (a,;)j;gN the strong mixing coefficients of the stationary sequence {Ui,Vi)i£f^. 

Proposition 1. Assume that for s>r'> q, ||C/o||r' < c. Relation (5) holds if ai = ©(i^") 
with 



3.3. Causal weak dependence 

Let {Wi)i(z'^ be a centered sequence with values in R''; for > 0, we set Mk = <^{Wj] 
< j <k). For each i S N, we define the 7 coefficients by 



IIA, -D\\,< Cn-^/\ ||iV„ ~ N\\p < Cn-^l^, 



(5) 




7, = sup||E(W^,+fc|7Wfc)||i. 

/c>0 
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Proposition 2. Assum-e that Ui and Vi are two stationary "/-dependent sequences with 
common ji = 0{i~'*). Assume that U and V are independent and that ||?7o||oo < c. Rela- 
tion (5) holds if 

( p s — 1\ g 
' \2 s-p) 2 



3.4. Non-causal weak dependence 

Here, we consider non-causal weakly dependent stationary sequences and assume that q 
and p are integers. A sequence (Wi)j;gN is A-weakly dependent if there exists a sequence 
(A(i))igN decreasing to zero such that 

I Cov(gi(W^,, , . . . , W,J,g2{W,„. . .,W,M 

(6) 

< (wLip gi + w Lip 92 + uwLip gi Lip g2)A(fc) 

for any u-tuple (ii, . . .,iu) and any w-tuple (ji, . . . with Ji < • • • < < + fc < ji < 
■ ■ ■ <jv, where 51,52 are real functions of A^^^ = {gi g A||jgi||oo < 1} defined respectively 
on R^" and R^" {u,v gN*). Recall, here, that A is the set of functions with Lip 51 < 00 
for some u > 1, with 

\gi{yi, ■ ■ ■ ,yu) ~ gi{xi, . . . ,xu)\ 
Lip .91 = sup — ■ — — — ■ — . 

\yi \-\yu-Xu\ 

The monograph Dedecker et al. [9] details weak dependence concepts, models and results. 

Proposition 3. Assume that the stationary sequence {Ui,Vi)i£fi is X-weakly dependent 
for Xi = 0{i^'^). Assume that p and q>2 are even integers. 

Relation (5) holds under each of the following sets of conditions: 



• the processes U and V are independent, ||C/o||oo < c and A > 

• for r' < s, \\Uo\\r' < c and A > 7732 f- 



Remark. 



9 . 



Non-integer moments q€ (2,3) are considered in [17], Lemma 4, and the same in- 
equality holds if E|Zi|«' < 00 with q' =q + 5, and A(i) = 0{i^^) with A > 4 + 2/q' 
for 

g < 2 + i( v/(g' + 4 - 2A)2 + 4(A - 4)(9' - 2) - 2 + g' + 4 - 2A) (< g')- 
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4. Regression estimation 

We now use a measurable bounded function A' : K.'' — > R. Consider a stationary process 
{Xi.Yi) G R*^ X R. We now set, for some h = /i„, 

Ui^n = Ui^n{x) ^ -^k( ^' ^ and Vi^n^Yi, 

where h tends to zero as n tends to infinity. Then i?„ = 'r{x) is the Nadaraya- Watson 
estimator of r{x) = E(Fi|Xi = x). Independently from the dependence structure of the 
process {Xi^Yi), we first introduce the following regularity conditions: 

(Al) for the point of interest x, the functions /, p are fc-times continuously difFeren- 
tiable around x and (p— fc)-H61derian, where fc < p is the largest possible integer; 

(A2) the function K is Lipschitz, admits a compact support, and satisfies K{u) > 
(Vw e R'^) and 



/ 

Js." 



K{u)du=l, 



■ ■ ■ u/K{u) du = if < ^1 + • • • + < fc- 

Moment and conditional moment conditions are also needed: 

(A3) for the point x of interest, there exist r and s, with r < s such that: 

1. ||Fo|U=c<oo; 

2. gr{x) = J \y\'^ f (x , y) dy is a function bounded around the point x; 

3. G{x,x) = supj^ fi{x,x) is bounded around the point (x^x), where fi{x',x") de- 
notes the joint density of {Xo,Xi). 

Remarks. 

• First, notice that the last condition holds immediately for independent sequences 
(Xi) with a locally bounded marginal density. 

• An alternative condition involving local uniform bounds of 

{x\x")^Hix',x") = sup [ \yy'\Mx\y';x'\y")dy'dy'\ 



where fi{x', y'; x" , y") denotes the joint density of (Xq, Vo; Xi,Yi), yields sharper re- 
sults, but such conditions are generally difhcult to check. They hold for independent 
sequences if gi is locally bounded. 

We note here that the assumption K > in (A2) implies that fc = 1 or 2. We are not able 
to control biases by hl^ in Proposition 4 if p > 3. Without this non- negativity condition, 
the moments of numerator and denominator are still controlled, but we definitely cannot 
handle the moments of our ratio. 
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4.1. Moment estimation 

Wc now consider the quantity Sn{x) ~ \\r{x) — r(a;)||p for x £ W^. 

Proposition 4 (Bias). Assuming that f is bounded below around the point x and that 
(Al) and (A2) hold, we have 



r{x) ■ 



Eg{x) 



Ef{x) 



0{hP,) 



We now assume that the numerator and denominator satisfy the usual rate: 

(A4) For the point of interest x and for some q> p, for ft, = /i„ — > and nhf^ — > oo as 
n — )■ oo, there exists a constant c > such that: 

1. \\fix)-Efix)\\,<c/./^; 

2. \\g{x)-Eg{x)\\.,<c/^^. 

Proposition 5. Assuming that f is bounded below around the point x and that assump- 
tions (A2), (A3) and (A4) hold, we have 



\x) 



x) 



Ef{x) 



< C{1 + /i'^^(i/'^-i)(nft'*)(^-i)/2 + (nft'')-"/2„iA)„„ 



with a = q(- — - — -) and B = . . 

Those two propositions imply that the optimal window width h ^ n~^/^^P~^'^^ equili- 
brates both expressions to get the minimax rate (5„(x) = 0(n~''/'^^''+'^^): 

Theorem 2. Choose the window width ft„ = Cti^^/'^''^'''' for a constant C > 0. Assume 
that f is bounded below around the point x and that assumptions (Al), (A3) and (A4) 
hold for 



pd{r — 1) 



V 



pd 



< 



qi — pq — pr qs — pq — ps — 2p 
There then exists a constant C > such that \\r(x) — ?'(a;)||p < Cn^P^^^P^'^\ 



(7) 



We now consider specific dependence structures to get the moment inequalities of (A4) 
for / and g. From now on, fix x and write 



(x) - Eg{x) - 4d 51 = - ^K^Y„K, 



9( 



X,, — X 



4.1.1. Independence 

Proposition 6. Assuming that {Xi,Yi) is i.i.d. and that (A2), (A3) with r^q hold, 
then the (A4) moment inequalities hold. 
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4.1.2. Non-causal weak dependence 

We now work as in [14], except for the necessary truncation used in [2]. 

Proposition 7. Assume that {Xi,Yi) is X-weakly dependent and that (A2) and 
(A3) hold. Assume that < 00 for some s > 2p. If X{i) ~ 0{i~^) with A > 

('r-p)^(r-2pXr- 1') ~ ^ ^^^^T^(l ^ -'-)' ^'^s^ (^4) moment inequalities hold. 

4.1.3. Strong mixing 

Proposition 8. Assume that (A2) and (A3) with r > q hold, and that {Xi,Yi) is a- 
mixing with ai ^ 0{i~°') . If we also suppose that either: 



• « > ((9 - 1)73^) V ^=^^ and h ^ with ad < or 

• p,q are even integers and a > §^r^(l — ^), 

then the (A4) moment inequalities hold. 



Remarks. 

• In the first item, the previous hmitation on h can be expressed as p>d/p + 2 if one 
makes use of the window width h ~ n~i/(2p+d)^ optimal with respect to power loss 
functions; this loss does not appear for integral order moments of the second item. 

• In the case of absolute regularity, Viennet [29] provides sharp bounds for some 
integrals of the second order moments of such expressions. We do not derive them 
here, even if integrated square errors have specific interpretations: we need higher 
order moments in our case. 



4.2. Uniform mean estimates 



We now investigate uniform bounds: 



sup \r{x) — r{x)\ 



(8) 



In this setting, Ango Nze and Doukhan [1] prove the needed results under mixing as- 
sumptions; Ango Nze et al. [4] and Ango Nze and Doukhan [3] provide bounds under 
weak dependence conditions. For this, assumptions and lemmas need to be rephrased by 
replacing Nn — and I?„ — Dn by suprema of those expressions over x £ B for some 
compact subset B CM.'^: 

(A5) The condition (Al) holds for each x £ B. 

(A6) The condition (A3) holds for each x G B. 

/log n 



(A7) For some q > p and w„ 



w„Vlogn, there exists c > such that: 



1- l|sup^g_B|/(a:) -E/(.T)|||5 <cu;„; 
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2. \\sup^^B\g{x) -Eg{x)\\\p<cwn. 
We begin with two preliminary propositions before stating our main result. 

Proposition 9 (Uniform bias). Assuming that f is bounded below over an open neigh- 
borhood of B and that (A2) and (A5) hold, we have 



sup 



ix) 



¥.g{x) 



Ef[x) 



Proposition 10. Assuming that f is bounded below over an open neighborhood of B 
and that assumptions (A2), (A6) and (A7) hold, we have 



sup 



"(x) 



Egix) 



Ef{x) 



< C(l + /i''^(i/'-i)(„/,d)(/^-i)/2 + (n/i'^)-"/2^i/.^)y;„ 



with a = q(- — - — -) and B = _jPJL 



r-p) ■ 



The following theorem derives from the two previous propositions. 



Theorem 3. Let {Xt,Yt) be a stationary sequence. Assume that conditions (A2), (A5), 
(A6) and (A7) hold for some s > 2p. The optimal rate for h is 



\ogn 



l/i2p+d) 



(9) 



sup \r{x) — r{x)\ 



(10) 



The end of the section is devoted to different dependence conditions that are sufficient 
for (A7) to hold. 

4.2.1. Independence 

First, we evaluate the uniform bound for the moments under independence. 



Proposition 11. Let (Xt,Yt)ti£N be an i.i.d. sequence. If we assume that conditions 
(A2), (A5) and (A6) hold for some s>2p, p>dp/{s—2p), then (A7) holds, hence (9) 
yields the bounds 



sup |/(a;) - f{x)\ < Cilogn/nr^^'P+^l 

xeB p 

sup 15(2;) - g{x)\ < C(logn/n)''/(2p+rf). 

xeB p 



(11) 

(12) 
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4.2.2. Absolute regularity 

Proposition 12. Let {Xi,Yi)i^fi be an absolute regular (also called /3-mixing) sequence. 
Assume that conditions (A2), (A5) and (A6) hold for some s > 2p, p > dp/{s — 2p). If 



then assumption (A7) holds, hence the choice (9) yields the bounds (11)-(12). 
4-2.3. Strong mixing 

Using the Fuk-Nagaev inequality in [24] also yields an analogous result. 

Proposition 13. Assume that the process (Xi,Yi)i^tq is stationary and strongly mixing 
with a, = 0(1-°') for a > 3ps+2ds+dp.-jpp-^3dp^dpp y 2^. If we further assume that 

conditions (A2), (A5) and (A6) hold for some s > 2p, p > dp/ (s — 2p), then assumption 
(A7) holds, hence (9) yields the bounds (11)-(12). 

4.2.4. Non-causal weak dependence 

Proposition 14. Assume that the process {Xi,Yi)i^z is stationary and X-weakly depen- 
dent with X{i) = ©(e"'''* ), b > 0. If we further assume that conditions (A2), (A5) and 
(A6) hold for some s > 2p, p>dp/{s — 2p), then assumption (A7) holds, hence (9) yields 
the bounds (11)^(12). 

Remark. Other dependence settings may also be addressed. For example, the ^-mixing 
case considered in [8] and the use of coupling in weakly dependent sequences by Dedecker 
and Prieur [10] both yield suitable exponential inequalities to complete analogous results. 

5. Proofs 

In the proofs, C > is a constant which may change from one line to another. 

5.1. Proof of the main lemmas 

Proof of Lemma 2. Setting z ~ — _D„)/Z?„, we rewrite 



we assume that the mixing coefficients satisfy f3i — 0{i ^) with f3 > 



sp-\-{2s~p)d 
p{s — 2p)—pd 



V(l + f), 




(13) 



But - 1 = 



z + r^—^ , hence for a e [0,1], 



\~z 



1 
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which implies that 



g»|A„| < |iV„ - iV„| + \Nn\ ^'"^ + \Nn 



Dr. 



\Nn\ 



< \N,,~N,,\ + ^\Dr,-D,,\ 



Dn 



|5„-Z)„|i+« 



\l+a 



\Dr, 



< \Nn - iV„| + - A.I + max \V,J3l^^ 

D„ l<i<n \Dn\ 



Proof of Lemma 1. From the preceding relation, we have 



, ig„-A.r+° 

max \Vi^n\ fT^-f- 

l<t<n I All 



(14) 



From the Holder inequality with exponents l/a + 1/6 = 1, we have 



Dn 



\Dn-Dn\ 



^n 



Dn Dn 1 1 p6 ■ 



Now, the assumption ||?7i,„Fi_„||r < C„ implies that |j-/Vn||r < Cn- The second term in 
the right-hand side of inequality (14) is bounded using the property IjA'^nll pa < \Nn\ + 
\\Nn — NnWpa- Consider now some (3 E [0, 1] and u,v >0 such that 1/u + 1/v = 1, to 
be determined later. Then \Nn~ Nn\^ \Nn - Nn\^\Nn ~ Nn\^~^ ; the Holder inequality 
implies that if we choose upa{l ~ (3) =p and vpaP = r, then 

ll^n - NnWpa < ||iV„ - iV„ 1 1 1 1 iV„ - Nn^^ 

< \\Nn-Nn\\l-^\\Nn-Nnf, 

thus ||iV„||pa < \Nn\ + vj-l'^Nnl''^ + C/^). Setting b = q/p, we derive a = {q-p)/q, hence 
:S = ^(1 -/?) and i = 7(1^/3. With the relation 1/u + l/w = 1, we find /3 = pr/q{r - p). 
Then 



J^IIA. - DnW,, < -^{\Nn\ + vlr^{\Nn\' + ))«„■ 



The last term in relation (14) is more difficult to handle; it may be bounded using the 
Holder inequality with exponents 1/a + 1/6 = 1 and 



max I Vi . 

l<i<n ' 



< 



1 



max |K:,n| II I A - Dn 



1 + Q| 



\pb 
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, i/(pa) 



if g > pb(\ + a) or, equivalently, if q — p(l + a)> q/a. We use an argument of Pisicr [22]: 
if if : IR+ — >■ IR+ is convex and non-decreasing, then 



(Emax|K:,„r") < IE(^(max |K,,„|P'' 



Hence Emaxi |yi.„|^° < (nc)^"/* with (^5(2;) = x^^p"-. Now, the bound in the right-hand side 
of (14) can be expressed as w^+"(nc)^/*/|D„|" if s >pa; for this, we use 1 — ^(l + a) > 
i > 2 if a > is small enough with i > + 1. □ 

a — s ^ p — q s 

Proof of Corollary 1. The proof of this result is standard. Namely, p' <p implies from 
the Markov inequality that for Z„ = the sequence {Zn)P is uniformly integrable. 

Set fk{z) ~ \z\P l{\z\<k} - From convergence in distribution, 

EfkiZn) ^ EA.(Z) 

and this occurs uniformly with respect to k from uniform integrability. □ 



5.2. Proofs for Section 3 



Proof of Theorem 1. Set A„ = i?„ — R and refer to Lemma 1: here r and s are such 
that = 1, and a — 2/s. Because w„ = Cn"^/^, we get 

D\\AJL< 1 + 2 \ \ ]vn, 

where the last term in the parenthesis is bounded with respect to n, implying that 

\\Rn-R\\p^O{l/V^). □ 

Proof of Proposition 1. Set Zi — UiVi — EUiVi. Let a^^{u) = J^iyo^i"^ "^i}' ^^'^ 
denote by Qz the generalized inverse of the tail function a; n- P(|Zo| > x). From heredity, 
the mixing coefficient a^. of the sequence Z is bounded by a^. Theorem 2.5 in [24] shows 
that 



l/p 
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But, as ||Zo||r < c, we have E|Zo|'' = Q^zi"^) < c*". Define a = r/p and b = r/{r — p). 
Then, by the Holder inequahty, 



< 



l/iap) 



From a > pb/2, the first integral is finite and || A'",i — A^||p < C/y/n. For = Un — EC/n 
and b' ^r' / [r' — q), we similarly get that ||Z?„ — D\\q < C/y/n. □ 

Proof of Proposition 2. If U and V are independent sequences, then {Yj) = {Ui — 
KUi) is 7-dependent with the same coefficients, as is (Zi) = {Ui — EUi){Vi — EVi), since 
||E(Z,;|A^o)||i < \\E{U^\Ma)-W^\\l\\E{V^\Ma)-EV,\\l < 2c7i. Denote by Gz the inverse 
of X i-^- Jq Qz{u) du. Corollary 5.3, page 124 in [9] states that 



< V 2pn 



\Z\\i 



ij~\u) A nf^'^QPf^ o Gz{u) du 



i/p 



As U is bounded by c, E\Z\' = Q'f^ o Gz{u) du < c^ 

Define a~ (s — l)/{p— 1) and b = {s — l)/{s ~ p). Then, by the Holder inequality, 



<v/2^( / (7^^(w)Anf''/^dw 



i/(&p) 



Q7^oGz(m) du 



l/(ap) 



Because 7 > the first integral is finite and ||iV„ — A^Hp < Similarly, 

\\Y\\l 



< ^2qn 



1/9 



(7-i(u) A n)''/'Q|ri o Gy(u) duj 

< y^{2cy-^ (^J^ A 71)''/^ du^ 

Because 7 > §, the first integral is finite and ||Z3„ — < 

Proof of Proposition 3. Let Z, = UiV^ - EC/,K . 

• Define = [Zi-^ , • • ■ , Zi„)- If and V arc independent, then 

Cov(gi(Z,),.g2(Zj)) = Cov(E(.gi(Z,)|;7),E(.g2(Z,/)|;7)) 
+ ECov(.gi(Z/),.g2(Zj)|f/). 
If we define gi{ui) = E(pi(Z/)|C// =^7), then 

1.91 («/) - 51 K) I < Lip.giE|Fo| E - <\ 



□ 
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so that gi{Ui) is Lipschitz with respect to {Ui,Vi). We have 

CoY{giiZi),g2{Zj)\Uiuj ^uiuj) ^Cov{g,iVi),g2iVj)). 

Here, gi is a Lipschitz function with coefficient maxig/ Lipgi. From (6), we con- 
clude that (Zi) is also A-weakly dependent with Xz{k) < (c A EVb)fc^'''. From Theo- 
rem 4.1, page 77, and Proposition 13.1, page 293 in [9] extended to A-dependence, 

n 
i=l 

The second term is negligible as n tends to infinity and the first sum over k is 
bounded so that || J27=i < Cn^/^ ^^^^ ^^ms ||iV„ - A^H, < 
• In the second case, ||J7o||r' + ll^llr' < oo and Proposition 2.1, page 12 in [9] implies 
that Xz{k) < Cfc-«'-'-2)^)/'-' . □ 



< 



C(2g-2 




k=0 



5.3. Proofs for Section 4.1 

Proof of Lemma 4. The previous convergences Ef{x) — >■ f{x) and ]Eg(x) — >■ r{x)f{x) 
are controlled by 0{hP^) under p-regularity conditions (Al) (see Ango Nze and Doukhan 
[3]). Write 



r{x) — 



E?(.t) 



E/(x) 



^ \g{x) -¥.g[x)\ ^ |je^(^)|M^WM, 



f{x)¥.f{x) 



Since / is bounded below by around x, E/(a;) is also bounded below by and we get 
the result. □ 



Proof of Proposition 5. Condition ((A3)-l) gives ||Vi_„||s < oo and |]?7i,nVi,„|j,- 
0{hd{i/r-i)^ for r < s whenever ((A3)-3) holds. 
Indeed, ||Fi,„||s = ||Fo||s = c < cxo, so, for r < s, 

E|[/,,„y,,„r < / I h~''''K''{{X,^x)/h)\y\^f{x,y)dxAy 

< I h-'''^K'\{X,-x)lh)gr[x)Ax 



< brllooy h-'-''K'-{{X,-x)/h)dx 

Set Kh„{-) = h~'^K{-/hn) and denote by * the convolution. Here, Dn = Ef{x) =E[/i_„ = 
/ ★ Kh^{x) — !> /(a;), the marginal density of Xq, and Nn = Eg(a;) = EJ7i^„Vi,„ = {rf) ★ 
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Kfi^{t) — > r{x)f{x). From Lemma 1, we get 



Dr. 



r[x) 



Ef{x) 



< 

p 



Dn D,, Dn 



where Z3„, -/V„ and c„ are equivalent to constants, C„ = Ch'"-^^^^ ^' and f„ = C{nh'^) 
Substituting the orders in the different terms, we obtain the result. □ 

Proof of Theorem 2. From the preceding propositions, we get 

\\?{x) - r{x)\\p < ChP„ + C{1 + + {nh'^)-'^^^n^/^)v„. (16) 

Note that hf^ — Cvn- The expression in parentheses is bounded if 

r a + 2p s 

These conditions correspond to (7). □ 

A bound of interest which does not use dependence conditions 

The proofs of the propositions under different kinds of dependence make use of a common 
bound that holds in all cases. For a positive integer k, we define the coefficients of weak 
dependence as non-decreasing sequences {Ck.q)q>2 such that 

Cfc,, = sup |Cov(Z,, • • • Z,^,Z,^^, ■■■Z^J, (17) 

where the suprcmum is taken over all {ii, . . . , iq} such that I < ii < ■ ■ ■ < iq and where 
m, k satisfy im+i — im ~ k. Independently of the dependence structure, we get a bound 
for Ck,q. 

Lemma 3. If we assume (A3) and (A4), then Ck,p < Ch'^'^'-'^P^/". 
Proof. Define {ii, . . . , ip} as a sequence that attains the sup defining Ck^p- 

Ck,p = \MZ,,---Z,J 

<2PE\Y,,K,,---Y,^K,J 

< 2P(E|1^, •••l^;J^/f)f/^(maxE(/C,,if,J^/(^-P)^'"''^' 



<2Piiroii?(y J K^/(^-p)(^^y^^/i^-p)(^^y(u,t)dudt^ 

< 2P\\Yo\[Ph^'^'^^-P^''l 



l-p/s 
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The claim in the remark following (A3) is based on the fact that there is no need to 
use the Holder inequality if H is bounded around {x^x) and thus Ck,p < Ch^'^. Now 
Ck.q < Ch'^'^ also holds for the denominator (we may also set = 1). □ 

Proof of Proposition 6. From the Rosenthal inequality for independent variables, 
there exist constants Cg > depending only on q (see, for example, Figiel et al. [19] for 
more details concerning the constants) such that 



E 



<c„ 



q/2 



Here, ¥.Zl < 2«E|Xjy,|«. In the beginning of the proof of Proposition 5, we get E\K,Y,\'i < 
llffglloo||A'||«/i^ and we deduce E|X;^*|'<C((n/i'')9/2+n/i'*), and 

C 



\\g{x)-Egix)\\q< 



The case of the denominator is obtained by setting Yi = l. 



□ 



Proof of Proposition 7. We first establish a Rosenthal inequality for weakly dependent 
variables. For any integer p>2, E(^"^p Z^y < plAp, where 



Then 



p-2 



p^i- 



(18) 



(19) 



1=2 



To bound the sum with coefficients Cfe.p, we shall use the following lemma. 

Lemma 4 (Doukhan and Neumann [15], Lemma lO-(ll)). If we assume that the 
stationary sequence {Zn)nez is X-weakly dependent and satisfies fi = ||^o||r < 1 for some 
r > p, then 

Ck,p < 2P+3pV^^"'^/*''"'^A(fc)('^-f)/('-2). 

Recall that if {Xi,Yi) is A-wcakly dependent with A(fc) < Ck~^, then Zi is A-weakly 
dependent with A(fc) < Cfc^'^'-''^^^/'' (see Proposition 2.1, page 12 in our monograph [9]). 
From Lemma 4, we get Ck,p < C/i''(P-i)/('^-i'fe-^(i(P)/'''. 

We define conditions on the dependence coeflacients that ensure that the sum to control 
is of the same order of magnitude as its first term, that is, 0{h'^). From Lemma 3, for 
any < 6 < 1, 
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< Ch'^ X fc-^K'-p)/'-/jd(-K'-p)/('-i)+(i-'')(i-2p/s))^ 

Choosing b < 2i^(s-^)+2p~s ' exponent of h in the parentheses is positive. Because 
> i ■ ^^r-'p' ' ^'^^ sum over fc (in inequahty (19)) converges and is less than a constant, 

say Op. We get Ap < apuh"^ + Y^^iZ2 ^i^p-i ^-^d from A2 < a2nh'^, we deduce by induction 

on p that Ap < Cpinh'^yl'^ for a sequence Cp which may also depend on x. Hence, |lg(x) — 

'm{^)\\p<Cp(x){nh'')-"Pl^. 

The case of the denominator is obtained by setting Yi = \. In this case, we also note 

that the bound in Lemma 3 can be expressed as Ck^q < Ch?"^ , since a Holder inequality 

is no longer needed. 

<c;i'^xfc-^''/i(-''('^-2)+(i-^)'^). 

Choosing b < d/{2d~ 2), the exponent of h in the parentheses is positive. Because A > 
q — 1/6, the sum over k in inequality (19) converges. □ 



Proof of Proposition 8. • Under the first set of conditions, Rio [24], Theorem 6.3 
states the following Rosenthal inequality: 

I IP / \^^^ 

^\Y.^ <ap(^l]5Zl*^°^(^*'^j)lJ +nbpj (a~^(u) An)^"^g|(u)du. 



We use Lemma 3 to prove that the first term is 0{{7ih'^)P^^). From the Davydov inequal- 
ity, we get a second bound for the covariance: 

I Cov(Zo,Z,;)| < daf^/'^WYoKorr < Cat'^^li^'^'/^-'K 



Var(Zo) - /i'^.g2(a;) / K^{u)du 



Hence, 



^5^Cov(Z„Z,) - ^Var(Z,) < nh^'Y^at'''^'' h'^'^'^^-''^ A h^'^'-'^^l 

i j i i 

Thus, considering some < < 1, 

J2a]~^^''h'^'-^/''~^'^ A < ;jd(2b(l/r+2A-2) + l-4/s) ^^6(l-2/r)_ 



This last term tends to if 6 > 4g^!^2s-4r ^'^'^ b(r-2) ■ This is possible if a > 

r(4sr— 2s— 4r) 
(r — 2)(sr— 4r) ' 
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Consider the second term and apply the Holder inequality with exponents r/[r ~ p) 
and r/p: 

rl / rl \ {i'-p)/r 

n J {a-\u)AnfQPziu)du<nU (a^^ duj \\Z\\p. 

The first integral is convergent as soon as a > ^^^Zp"^ and, from assumption (A3)-3, 
||Z||;: < C/i''(i-'^) so that this second term is negligible if n/i'^P^i"'')/'' < Cinh'^y/^. Hence, 
if the sequence 

(2-p)^dp(2-3r) bounded, we obtain the desired bound. 
Now, consider the denominator, 

^ ^ Cov{K,,Kj) - Var(A',) < nh'^ ^ a^h-'^ A h'^. 

i j i i 

Thus, J2i Ah'^< /i^'^i-^b) for < < 1. This last term tends to if & > 1/2, 

which implies that a> 2. 
Consider the second term. 



{r-q)/r 

WKoWl 

\Jo / 

The first integral is finite if a > ^^^1^"^ ■ Analogously, the second term is negligible as soon 
as n^(^^i^h'^i(^^'^) is a bounded sequence. 

Hence, if ft, ^ n^°, a monotonicity argument shows that the previous bounds require 

O'^^ 33277- 

• Under the second set of conditions, we use the idea from the proof of Proposition 7 
(this idea was initiated in [16]). We again use relations (19), and expression (18) is 
bounded by using the alternative bound of Ck,p, which can be expressed as \EZi-^ ■ ■ ■ Zi^ \ 
for a suitable sequence ii < ■ ■ ■ < ip with iu+i — iu = k. The Davydov inequality (see 
Theorem 3(i) in [13]) and the Holder inequality then together imply that 

Cfc,p < Ga^^ni^n • ■ • Z^^\r/u\\Z^,,^, ■ ■ ■ Z,Jr/(p^u) < 6(4:^'' \\ Zo\\P 

and Ck,p < Ch'^{al'''h-'^) A ft'^(i-2p/«) < c'/jdcifc^/''/i'^(i-2p/«-2''(i-p/(2^))) from Lemma 3 
if < & < 1. Then, setting b= {s - 2p)/ {2{s - p)), 

nY{k + lY~^Ckp^O{nh'^) if a>-^^f l^- 
to ' 2s-p\ p 

The case of the denominator is exactly analogous and here we replace —phyq, s by cx3 
and, in order to let the previous condition unchanged, we replace r by r' with §"(1 ~ ^) = 
r£^(l_i). □ 

2 s—p ^ "p ' 
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5.4. Proofs for Section 4.2 

Proof of Proposition 9. The previous convergences E,f{x) f{x) and Kg{x) — >■ 
r(x)f{x) arc uniformly controlled by 0{hl^) under p-rcgularity conditions (A5) from the 
continuity of derivatives over the considered sets and a standard compactness argument. 
The proofs do not use the positivity of K so that arbitrary values for p are possible; 
see Ango Nze and Doukhan [3] . Note that ii V 3 x denotes an open set over which the 
previous assumptions (Al) hold and such that infs / > 0, then for each open set W with 
W CV, the previous relation holds uniformly over W. Hence, under (A5), if V denotes 
an open set with B cV such that the assumptions (Al) still hold, the bounds for biases 
hold uniformly over B. We thus proceed as in Proposition 4 to complete the proof □ 



Proof of Proposition 10. From Lemma 2, 

Eg{x) 



inf Dn{x) sup 
2;es ^fzB 



r{x) 



Ef{x) 



< sup|7V„ - Af„| +sup 

B B J^n B 



sup \Dn- D„ 



max I Yi I sup ■ 

l<i<n Q 



\D„~D, 



ll + Q 



As in the proof of Lemma 1, substituting the supremum to the variables, we get 

Eg{x) 



inf Dn{x) 
xeB 



sup 

xeB 



r{x) 



Ef{x) 



< 



f supg|jV„| ,np s\N,-^Pwlr^ Ctwj-P <c„nVn 
^ ^ infsA, ^ infsA. iniBDn. inf s / 



where D„, Nn and c„ are equivalent to constants and C„ = Ch'^^^^^ ^\ Substituting the 
orders to the expressions gives the result. □ 

Truncation and variance estimation 

This paragraph introduces some common elements of proofs of Propositions 11-14. 

Let M > and consider the truncated modification of y^, li = < M} — 

Ml{Yi < -M} + Mt{Y, > M}. Define g{x) = -^YT^=x^i^^■ Then, from the Markov 
inequality. 



sup \g{x)-g(x)\ 



With the choice (9), in order to conveniently bound this term, we assume that 



(20) 
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Let Z{x) = V nh'^\'g{x) — Eg(x)|. Below, we will need (uniform) bounds of 

1 _ _ 2 ""^ ~ ~ 

VarZ(.T) ^^Y. {n~V\)CoM{YoKo,Y,K,) < ^ | Cov(roifo, i^^i^OI- 

|i|<n 2=0 

Set Vi{x) = I Cav{YiiK[),YiKi)\. A first bound of Ti{x) for i 7^ comes from Lemma 3: 
there exists a constant Ci such that 

For independent random sequences, ro(a:) is the only non-zero term. 

Tq{x) ^Ch'^g2{x) j K'^{u)(1u around the point x. (21) 

Proof of Proposition 11. In order to check assumption (A7), from the Bernstein 
inequality for independent bounded variables, we get 



P(Z(2;) >u) <2exp • (22) 

V 2{\&v{Z{x)) + 2Mu\\K\\^/{i^/nhd))J 

From (21), Var(Z(a;)) is bounded by a constant. Because K is a Lipschitz kernel, x 1— >■ 
Z{x) is a Lipschitz function and 

\Z{x) - Z{y)\ < 2V^Mh-^'^+^^ Lip/-i:||a; - y\\i. 

Let Mk be the size of the support of K and for S < Mxh, let (i?j)j=i,...,i/ be a regular 
partition of diameter d over B; denoting by Xj the center of Bj, we get 

cS 

sup \K,{x) - K,{xj)\ < -rt{\xj - X,\ < 2hR). 

Write g{x) ~ £5(2;) = g{x) - g{xj) + E(g{xj) - g{x)) + g{xj) - ¥.g{xj). 
For X E Bj, 

\g{x)-g{xj)\ < {c5/h)g{xj), 

where 

5(^) = ;^El^'|l{l^-^^l^2/ii?)} 

so that \g{x) - Eg{x)\ < {cS/h){g{xj) +Eg{xj)) + \g{xj) ~Eg{xj)\. 
Letting Z{x) = Vnh^\g{x) — Eg(x)\ and Zj ~ sup^^^^. Z(x), we have 

\Z,\ < -j^\Z{x,)\ + Eg{xj) + \Z{x,)\. 
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Note that since Eg(x) tends to gi{x) = J^^^^j^\y\f{x,y) dy < gi{x) as h tends to 0, 

2cSEg{x)/h < t/{i\fn}?) holds if i5 < c'th/Jnf? for a suitable constant c' > 0. This 
condition holds for the following choice (considered only for large values of i > 0): 

Ch 

5 = ^ and Q<to<t. (23) 

Let Z = sup^g^ Z{x). For t>to, 



HZ>t) =pf max 
\i<i<i/ 



< V max P(Z, > t) 

1<J<U 



<.max^|p(^|Z(.,)|>0+p(|Z(.,)l>^ 



< I. max <j P( \Z{x,)\ >^]+n \Z{x,)\ > -]\ (24) 



^^'^^"p(- l + tMlL^)-vO (25) 

for some a > (from the relation — > oo, we assume that h/cd >1 in order to derive 
relation (24)). Now, 



/>oo 

EZP<TP+p F{Z > t)tP~^ dt. 

JT 



Choose T = y/ A logn, note that the function u n- cxp(— j^^) is non-increasing and as- 
sume that M{nh'^)~^/'^ < T^^ . That is, with the choice for h 



M<A-^^^h^P, (26) 



we derive 



EZP <TP + Apv I tP-^ exp 
Jt 

/>oo 

< TP + Apiy / tP-^exp 
Jt 



at 



atT 



1 

' dt 



dt 



2 

and using the incomplete gamma function expansion for x>1p 

poo 

uP-^c-''du<2xP-^e-'' 
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and setting u = aiT/2 in the previous inequality, we obtain 

2 \ P 



EZ'P <TP + Apv 



aT 



With 1/ ^ 6 and relation (23), the second term is negligible with respect to the first one 
if A is chosen large enough. Then W^Z^ ~ ©((logri)^'/^) and = \J nh'^\ sup^g^ 15(2;) — 
I^5(^)lllp = ^((log"-)^^^)- We check that conditions (20) and (26) on M are compatible. 
This holds if p > dp/ (s - 2p) . □ 

Proof of Proposition 12. First, we define a truncation level M satisfying (20) and 
consider the truncated variable 'gix). A strong coupling argument by Berbee (1979) yields 
a Bernstein-type inequality. With Theorem 4 of Doukhan [13], we recall, analogously to 
(22), that there exist some 0, A, /U > and an event (which does not depend on either 
X or u) with 



\{^Z{x)>u)^An) < 4exp 



2(Var(Z(a;)) + 2Mqu\\K\\oc./{SV^)) 



(27) 



with P(v4^) < We first check that Var(Z(a;)) is bounded by a constant independent 

of X. 



n-l 



|Var(Z(x))-rt(x)|<^5]r,(a;) 



(28) 



As /3-mixing conditions do not improve the bound of the variance, we use a strong mixing 
condition and the relation < A , and refer to the section dedicated to strong mixing. 
Now, inequality (27) is exactly (22) with qM substituted for M. Following along the lines 
of the proof for the independent case, we get 



^((Z>t)nA„)<4i/exp - 



l+<gA/(n/i'i)-i/2 

The condition (20) on M can now be expressed as qM < h^^A^^^^ and the end of the 
proof remains unchanged. We get 



sup \g{x) -Eg(x)| 

xeB 



<C 



los 



Then, using the trivial bound EZ^l^^. < ||2'||p,P(A^), 
Vnh^ 



sup \gix)-Eg{x)\ 



<Csl^^ + 2M\\K\\ 



(29) 



Defining q = n'^ , with < 7 < 1, compatibility of the inequalities concerning M re- 
quires that n-^^h-P > h-P^P+'^^/^''~P\ This holds if 7 = ^2p+df{7-p} ■ Choosing M = 
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Ch ^P/'** p))ip+d-) ^ second term of (29) can be expressed as 

^^^^ ^ ^ log(n)(p/(^-rt)(p+^) ^^_^^^^^,^^_^^^^^^,^_ 
Vnh^ Vnh^ 

This is neghgiblc with respect to the second term as soon as /? > p('g'l^2p)^pd ■ '-' 

Proof of Proposition 13. As in the independent case, we choose a truncation level 
M = {^)iP+d)p/{(2p+d)(s-p)) satisfying relation (20) and define Z{x) with respect to 
the truncated process g. The Fuk-Nagaev inequality leads to 

n./^/ , , / Y'^^ 16nM||iir||oo / uV^ 
P(Z(x)>ii)<4 1 + — — — — — + " " a 



16rVar(Z(x)); V4M||if|| 

We bound Var(Z(a:)) above by a constant independent of x, using the bound of Lemma 
3 and T,{x) = | Cov{YoKo,Y,K,)\ < 2 J^' Q^^(t) dt < 2a^M\\K\\^, and by adapting the 
proof of Proposition 8. Then, following the proof for the independent case, we get 

^2 X -r/2 . ^ 1+a 



>t)<CQv[l + +C2vn[—=] r" 

for suitable constants cq, c\ and C2. We choose T = \/ AXagn and r = bT^. Then, 

fnyp/(2p+d) ^i^l ^Yiis choice, = y^(-n_)dp- p{s-2p) / {{2p+d){s-p)) 

foo / ,2 \ -bT^/2 

EZP<TP + coi^l + dt 

poo 

1 / 2 — Q , 



log" 
nh'' 



T 



MT ^ 



+ C2b°'i'n[ ^= T"-' / tP-'-'^dt. 
V Vnh'^ J Jt 

Changing the variable u = t/T, the second term equals 

/oo / ,,2 \ —bT'^/2 „oo 

uP-^(l + :^\ du<cov{c,bf^''^TP j uP-i-^^'du 



<co- 



&r2-p + i 



Setting b~ l/(eci), (cib)''^ < n ''^/^^ it is negligible for a suitable choice of A. 
The third term is less than 4iill^;ynTP-2(^/iogn)(i+")(''P-P('*-2p))/((2p+d)(s-p)) ^-^^^1 

vn = n(n/logn)''(''+i)/(2p+'i)(logn)'*/2. Thus, if dp > /9(s-2p) and 1 + a > ^'''tp-t(s~2p7^ ' 
then the third term is negligible. The end of the proof follows along the lines of the in- 
dependent case. □ 
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Proof of Proposition 14. Choose M = (n/logn)(''+'')p/((2p+d)(s-p)) ^nd define Z{x) 
as in the preceding paragraphs. Proposition 8 and Theorem 1 in [15] imply that 



>u) <cocxp 



Var(Z(a;)) + {Mu\\K\\^/Vn}?fl^<'+^) 



for suitable constants Cq and ci. We first check that Var(Z(a;)) is bounded by a constant 
independent of x. The generic term in (28) is bounded by using weak dependence and 
the fact that the function u ^ K{{x — u)/h) is C//i-Lipschitz: 

T^{x) <c(^^^ A(^) < C{h-^-^P+'^^P'^'~^^ + h-^)\{i). 

Considering some < a < 1, up to a constant, the right-hand side of (28) is bounded 
above by 

^h'^h (/i-(''+^) A(i)) < /i<i(i-2")-f'" ^ A"(i), 

i i 

where, in the previous relation, 6 = 2 or 6=1 + (p + d)p/{s — p), respectively, if s > 
{p + d + l)p 01 s < {p + d + l)p. Taking a < d/ [2d + b) and noting that '^"(*) ^ 
the corresponding sum is negligible; thus, Var2'(a:;) 52(2:) J K^{u) du. 
Following along the same lines as in the independent case, we then get 



1 + (tA/(n/i'^)-i/2)6/(b+2) 
for some a > 0. Choosing T = ^/ Alogn, MT jsj nh'^ < 1, we have 



00 



T 



<TP + Apv p tP-' exp (^- °^ ^ ^(b+4)/(b+2)j 
and then setting u = aT''^^'''+^' ^(b+4)/(b+2) ^ -(-j^g second term of (30) is less than 
- y /•°° (f,+2)p/(h+4)-i „^ 



(30) 



/9\ 2p/(b+4)-l 
< 4pz^ f - j 2.(b^+4b+8)p/((b+4)(&+2))-l^-aA/2_ 

With v ^ S^"^ and relation (23), the second term of (30) is negligible with respect to TP 
if A is chosen large enough. The remainder of the proof is the same as in the independent 
case. □ 
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